What's in a Thesaurus?

نویسندگان

  • Adam Kilgarriff
  • Colin Yallop
چکیده

We first describe four varieties of thesaurus: (1) Roget-style, produced to help people find synonyms when they are writing; (2) WordNet and EuroWordNet; (3) thesauruses produced (manually) to support information retrieval systems; and (4) thesauruses produced automatically from corpora. We then contrast thesauruses and dictionaries, and present a small experiment in which we look at polysemy in relation to thesaurus structure. It has sometimes been assumed that different dictionary senses for a word that are close in meaning will be near neighbours in the thesaurus. This hypothesis is explored, using as inputs the hierarchical structure of WordNet 1.5 and a mapping between WordNet senses and the senses of another dictionary. The experiment shows that pairs of ‘lexicographically close’ meanings are frequently found in different parts of the hierarchy. In the first part of the paper, we present different varieties of thesaurus. In the second part, we contrast thesaurus word senses with dictionary word senses and present a small experiment in which we explore whether ‘lexicographically close’ meanings are often close in the WordNet network.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

بررسی وضعیت نرم‌افزارهای مدیریت و ارائه‌ی اصطلاح‌نامه‌‌ای فارسی

The current study is devoted to investigate softwares for managing and providing Persian thesaurus. Therefore, using survey-descriptive method, we have analyzed five thesaurus management softwares, including the softwares “Islamic Sciences Thesaurus”, “Thesaurus Builder”, “Pars Azarakhsh”, “Ghamoos” and “published version of Ebrahimpoor Thesaurus”, along with four softwares for providing thesau...

متن کامل

بررسی تطبیقی اصطلاح‌نامه معارف اسلامی و علوم قرآنی

This study examines the comparative strengths and weaknesses of the thesaurus and thesaurus Quranic teachings of the Koran. In today's society where the documents are kept electronically, retrieval and dissemination of information for the development of research, much greater importance of saving documents and thesaurus that is the basis for indexing in various sciences, One of the solutions fo...

متن کامل

مسائل اصطلاحنامه سازی در ایران از دیدگاه تهیه کنندگان اصطلاحنامه

Introduction: The present research attempts to study the theoretical foundations of thesaurus construction before and after internet and identify the problems of thesaurus construction in Iran from the point of view of thesaurus makers and translators of the published thesauri.. Methods: The research population was 6 thesaurus makers (AbdolHossein Azaragn, Abbas Hori, Fatemeh Rahadoost, Faribor...

متن کامل

بررسی مقایسه‎ای روابط معنایی، ساختار شکلی و سیستم مدیریت اصطلاحنامه‎های فنی ـ مهندسی و نما

Purpose: Thesauri as important tools in storage and retrieval information systems have a significant role in the optimization of database search. So the publishing of thesauri needs to use standards as much as possible. I examined and compared two important thesauruses on the basis of ANSI/NISO z39.19 2005. Methodology: This study is an analytical and applied survey. The study population was t...

متن کامل

What's beyond query by example?

Over the last ten years, the crucial problem of information retrieval in multimedia documents has boosted research activities in the field of visual appearance indexing and retrieval by content. In the early research years, the concept of the “query by visual example” (QBVE) has been proposed and shown to be relevant for visual information retrieval. It is obvious that QBVE is not able to satis...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2000